智能论文笔记

Attend to the Right Context: A Plug-and-Play Module for Content-Controllable Summarization

Wen Xiao , Lesly Miculicich , Yang Liu , Pengcheng He , Giuseppe Carenini

分类：自然语言处理

2022-12-21

Content-Controllable Summarization generates summaries focused on the given controlling signals. Due to the lack of large-scale training corpora for the task, we propose a plug-and-play module RelAttn to adapt any general summarizers to the content-controllable summarization task. RelAttn first identifies the relevant content in the source documents, and then makes the model attend to the right context by directly steering the attention weight. We further apply an unsupervised online adaptive parameter searching algorithm to determine the degree of control in the zero-shot setting, while such parameters are learned in the few-shot setting. By applying the module to three backbone summarization models, experiments show that our method effectively improves all the summarizers, and outperforms the prefix-based method and a widely used plug-and-play model in both zero- and few-shot settings. Tellingly, more benefit is observed in the scenarios when more control is needed.

translated by 谷歌翻译

Natural Language to Code Generation in Interactive Data Science Notebooks

Pengcheng Yin , Wen-Ding Li , Kefan Xiao , Abhishek Rao , Yeming Wen , Kensen Shi , Joshua Howland , Paige Bailey , Michele Catasta , Henryk Michalewski

分类：自然语言处理

2022-12-19

Computational notebooks, such as Jupyter notebooks, are interactive computing environments that are ubiquitous among data scientists to perform data wrangling and analytic tasks. To measure the performance of AI pair programmers that automatically synthesize programs for those tasks given natural language (NL) intents from users, we build ARCADE, a benchmark of 1082 code generation problems using the pandas data analysis framework in data science notebooks. ARCADE features multiple rounds of NL-to-code problems from the same notebook. It requires a model to understand rich multi-modal contexts, such as existing notebook cells and their execution states as well as previous turns of interaction. To establish a strong baseline on this challenging task, we develop PaChiNCo, a 62B code language model (LM) for Python computational notebooks, which significantly outperforms public code LMs. Finally, we explore few-shot prompting strategies to elicit better code with step-by-step decomposition and NL explanation, showing the potential to improve the diversity and explainability of model predictions.

translated by 谷歌翻译

MASTER: Multi-task Pre-trained Bottlenecked Masked Autoencoders are Better Dense Retrievers

Kun Zhou , Xiao Liu , Yeyun Gong , Wayne Xin Zhao , Daxin Jiang , Nan Duan , Ji-Rong Wen

分类：自然语言处理

2022-12-15

Dense retrieval aims to map queries and passages into low-dimensional vector space for efficient similarity measuring, showing promising effectiveness in various large-scale retrieval tasks. Since most existing methods commonly adopt pre-trained Transformers (e.g. BERT) for parameter initialization, some work focuses on proposing new pre-training tasks for compressing the useful semantic information from passages into dense vectors, achieving remarkable performances. However, it is still challenging to effectively capture the rich semantic information and relations about passages into the dense vectors via one single particular pre-training task. In this work, we propose a multi-task pre-trained model, MASTER, that unifies and integrates multiple pre-training tasks with different learning objectives under the bottlenecked masked autoencoder architecture. Concretely, MASTER utilizes a multi-decoder architecture to integrate three types of pre-training tasks: corrupted passages recovering, related passage recovering and PLMs outputs recovering. By incorporating a shared deep encoder, we construct a representation bottleneck in our architecture, compressing the abundant semantic information across tasks into dense vectors. The first two types of tasks concentrate on capturing the semantic information of passages and relationships among them within the pre-training corpus. The third one can capture the knowledge beyond the corpus from external PLMs (e.g. GPT-2). Extensive experiments on several large-scale passage retrieval datasets have shown that our approach outperforms the previous state-of-the-art dense retrieval methods. Our code and data are publicly released in https://github.com/microsoft/SimXNS

translated by 谷歌翻译

Chatbots for Mental Health Support: Exploring the Impact of Emohaa on Reducing Mental Distress in China

Sahand Sabour , Wen Zhang , Xiyao Xiao , Yuwei Zhang , Yinhe Zheng , Jiaxin Wen , Jialu Zhao , Minlie Huang

分类：自然语言处理

2022-09-21

对心理健康支持的需求不断增长，强调了对话代理在全球和中国作为人类支持者的重要性。这些代理可以增加可用性并降低心理健康支持的相对成本。提供的支持可以分为两种主要类型：认知和情感支持。关于该主题的现有工作主要集中在采用认知行为疗法（CBT）原理的构造药物上。此类代理根据预定义的模板和练习来运行，以提供认知支持。但是，使用此类药物对情绪支持的研究是有限的。此外，大多数建设的代理商都以英语运作，强调了在中国进行此类研究的重要性。在这项研究中，我们分析了表情符疾病在减少精神痛苦症状方面的有效性。 Emohaa是一种对话剂，通过基于CBT的练习和指导性对话提供认知支持。它还通过使用户能够发泄所需的情绪问题来支持情感上的支持。该研究包括134名参与者，分为三组：Emohaa（基于CBT），Emohaa（Full）和控制。实验结果表明，与对照组相比，使用Emohaa的参与者在精神困扰症状方面的改善得到了更大的改善。我们还发现，添加情感支持剂对这种改善，主要是抑郁和失眠有互补的影响。根据获得的结果和参与者对平台的满意，我们得出结论，Emohaa是减少精神困扰的实用和有效工具。

translated by 谷歌翻译

Detecting Political Biases of Named Entities and Hashtags on Twitter

Zhiping Xiao , Jeffrey Zhu , Yining Wang , Pei Zhou , Wen Hong Lam , Mason A. Porter , Yizhou Sun

分类：机器学习

2022-09-16

美国的意识形态分裂在日常交流中变得越来越突出。因此，关于政治两极分化的许多研究，包括最近采取计算观点的许多努力。通过检测文本语料库中的政治偏见，可以尝试描述和辨别该文本的两极分性。从直觉上讲，命名的实体（即，用作名词的名词和短语）和文本中的标签经常带有有关政治观点的信息。例如，使用“支持选择”一词的人可能是自由的，而使用“亲生生命”一词的人可能是保守的。在本文中，我们试图揭示社交媒体文本数据中的政治极性，并通过将极性得分分配给实体和标签来量化这些极性。尽管这个想法很简单，但很难以可信赖的定量方式进行这种推论。关键挑战包括少数已知标签，连续的政治观点，以及在嵌入单词媒介中的极性得分和极性中性语义含义的保存。为了克服这些挑战，我们提出了极性感知的嵌入多任务学习（PEM）模型。该模型包括（1）自制的上下文保护任务，（2）基于注意力的推文级别的极性推导任务，以及（3）对抗性学习任务，可促进嵌入式的极性维度及其语义之间的独立性方面。我们的实验结果表明，我们的PEM模型可以成功学习极性感知的嵌入。我们检查了各种应用，从而证明了PEM模型的有效性。我们还讨论了我们的工作的重要局限性，并在将PEM模型应用于现实世界情景时的压力谨慎。

translated by 谷歌翻译

Entity-based SpanCopy for Abstractive Summarization to Improve the Factual Consistency

Wen Xiao , Giuseppe Carenini

分类：自然语言处理

2022-09-07

尽管最近的抽象性摘要在自动评估指标上取得了成功，但生成的摘要仍然与源文档呈现事实不一致。在本文中，我们专注于实体级别的事实不一致，即减少生成的摘要与源文档之间的不匹配实体。因此，我们提出了一种基于实体的新型跨度机制，并通过全球相关成分探索其扩展。四个摘要数据集的实验结果表明，跨度可以有效地改善实体级别的事实一致性，而单词级别和实体级别的显着性基本上没有变化。该代码可在https://github.com/wendy-xiao/entity基于基础上找到

translated by 谷歌翻译

Map Container: A Map-based Framework for Cooperative Perception

Kun Jiang , Yining Shi , Benny Wijaya , Mengmeng Yang , Tuopu Wen , Zhongyang Xiao , Diange Yang

分类：机器人

2022-08-28

合作感知的想法是从多辆车之间的共同感知数据中受益，并克服单车上车载传感器的局限性。但是，由于本地化不准确，通信带宽和模棱两可的融合，多车信息的融合仍然具有挑战性。过去的实践通过放置精确的GNSS定位系统来简化问题，手动指定连接的车辆数量并确定融合策略。本文提出了一个基于地图的合作感知框架，名为MAP容器，以提高合作感的准确性和鲁棒性，最终克服了这个问题。概念“地图容器”表示地图是将所有信息转换为地图坐标空间的平台，并将不同的信息源合并到分布式融合体系结构中。在拟议的MAP容器中，考虑使用GNSS信号和传感器功能和地图功能之间的匹配关系以优化环境状态的估计。对仿真数据集和房地车平台的评估结果验证了所提出的方法的有效性。

translated by 谷歌翻译

See Finer, See More: Implicit Modality Alignment for Text-based Person Retrieval

Xiujun Shu , Wei Wen , Haoqian Wu , Keyu Chen , Yiran Song , Ruizhi Qiao , Bo Ren , Xiao Wang

分类：计算机视觉

2022-08-18

基于文本的人检索旨在根据文本描述找到查询人员。关键是学习视觉文本模式之间的常见潜在空间映射。为了实现这一目标，现有的作品采用细分来获得明确的跨模式对齐方式或利用注意力来探索显着对准。这些方法有两个缺点：1）标记交叉模式比对很耗时。 2）注意方法可以探索显着的跨模式对齐，但可能会忽略一些微妙而有价值的对。为了缓解这些问题，我们为基于文本的人检索引入了一个隐式视觉文本（IVT）框架。与以前的模型不同，IVT利用单个网络来学习两种模式的表示形式，这有助于视觉文本相互作用。为了探索细粒的对准，我们进一步提出了两个隐式语义比对范式：多级比对（MLA）和双向掩码建模（BMM）。 MLA模块在句子，短语和单词级别上探索了更精细的匹配，而BMM模块旨在挖掘视觉和文本模态之间的\ textbf {更多}语义对齐。进行了广泛的实验，以评估公共数据集中提出的IVT，即Cuhk-Pedes，RSTPREID和ICFG-PEDES。即使没有明确的身体部位对准，我们的方法仍然可以达到最先进的表现。代码可在以下网址获得：https：//github.com/tencentyouturesearch/personretrieval-ivt。

translated by 谷歌翻译

AI for CSI Feedback Enhancement in 5G-Advanced and 6G

Jiajia Guo , Chao-Kai Wen , Shi Jin , Xiao Li

分类：人工智能

2022-06-30

第三代合作伙伴项目已开始研究2021年的第18版。人工智能（AI）空气界面是第18版的关键特征之一，其中选择了用于渠道状态信息的AI（CSI）反馈增强作为代表性。用例。本文提供了5G助长和6G中CSI反馈增强的AI的全面概述。首先介绍和讨论了5G效率的CSI反馈增强AI的范围，包括高架降低，准确性提高和渠道预测。然后，介绍并比较了三个代表性CSI反馈的代表性框架，包括单方面隐式反馈，基于双面自动编码器的隐式反馈和双面显式反馈。最后，已经确定和讨论了CSI反馈增强的AI标准化考虑因素，尤其是重点是评估，复杂性，协作，概括，信息共享，具有渠道预测的联合设计和互惠性。本文为基于AI的CSI反馈增强的标准化研究提供了指南。

translated by 谷歌翻译

GLF-CR: SAR-Enhanced Cloud Removal with Global-Local Fusion

Fang Xu , Yilei Shi , Patrick Ebel , Lei Yu , Gui-Song Xia , Wen Yang , Xiao Xiang Zhu

分类：计算机视觉

2022-06-06

可以通过合成孔径雷达（SAR）图像来缓解云去除任务的挑战，这些图像可以穿透云覆盖。但是，光学图像和SAR图像之间的较大域间隙以及SAR图像的严重斑点噪声可能会导致基于SAR的基于SAR的云去除，从而导致性能退化。在本文中，我们提出了一种新型的基于全局融合的云去除（GLF-CR）算法，以利用SAR图像中嵌入的互补信息。利用SAR信息的力量促进云清除需要两个方面。首先是全球融合，指导所有本地光窗口之间的关系，以维持与其余无云区域一致的回收区域的结构。第二个本地融合，传输嵌入在SAR图像中的互补信息，该信息与多云区域相对应，以生成缺失区域的可靠纹理细节，并使用动态过滤来减轻斑点噪声引起的性能退化。广泛的评估表明，所提出的算法可以产生高质量的无云图像，并且在SEN12MS-CR数据集中的PSNR方面，其增益约为1.7 db，超过最先进的云去除算法。

translated by 谷歌翻译